A time-delay neural network architecture for isolated word recognition

نویسندگان

  • Kevin J. Lang
  • Alexander H. Waibel
  • Geoffrey E. Hinton
چکیده

-A translation-invariant back-propagation network is described that performs better than a soph&ticated continuous acoustic parameter hidden Markov model on a noisy, lO0-speaker confusable vocabulary isolated word recognition task. The network's replicated architecture permits it to extract precise information from unaligned training patterns selected by a naive segmentation rule. Keywords--Isolated word recognition, Network architecture, Constrained links, Time delays, Multiresolution learning, Multispeaker speech recognition, Neural networks.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Time-derivative Neural Net Architecture - an Alternative to the Time-delay Neural Net Architecture

Though the time-delay neural net architecture has been recently used in a number of speech recognition applications, it has the problem that it can not use longer temporal contexts because this increases the number of connection weights in the network. This is a serious bottleneck because the use of larger temporal contexts can improve the recognition performance. In this paper, a time-derivari...

متن کامل

Multi-State Time Delay Neural Networks for Continuous Speech Recognition

Alex Waibel Carnegie Mellon University Pittsburgh, PA 15213 [email protected] We present the "Multi-State Time Delay Neural Network" (MS-TDNN) as an extension of the TDNN to robust word recognition. Unlike most other hybrid methods. the MS-TDNN embeds an alignment search procedure into the connectionist architecture. and allows for word level supervision. The resulting system has the ability to ma...

متن کامل

A connectionist recognizer for on-line cursive handwriting recognition

In this paper we show how the Multi-State Time Delay Neural Network (MS-TDNN), which is already used successfully in continuous speech recognition tasks, can be applied both to online single character and cursive (continuous) handwriting recognition. The MS-TDNN integrates the high accuracy single character recognition capabilities of a TDNN with a non-linear time alignment procedure (dynamic t...

متن کامل

Performance Through Consistency: MS-TDNN's for Large Vocabulary Continuous Speech Recognition

Connectionist Rpeech recognition systems are often handicapped by an inconsistency between training and testing criteria. This problem is addressed by the Multi-State Time Delay Neural Network (MS-TDNN), a hierarchical phonf'mp and word classifier which uses DTW to modulate its connectivit.y pattern, and which is directly trained on word-level targets. The consistent use of word accuracy as a c...

متن کامل

A Comparative Study of the Multi-Layer Perceptron, the Multi-Output Layer Perceptron, the Time-Delay Neural Network and the Kohonen Self-Organising Map in an Automatic Speech Recognition Task

This paper describes a study of the use of four different neural network techniques for automatic speech recognition (ASR) using two common, real-world application databases. The neural network techniques investigated were the Multi-Layer Perceptron (MLP), the Multi-Output-Layer Perceptron (MOLP), which is an improved version of the MLP , the Time-Delay Neural Network (TDNN) and the Kohohnen Se...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Neural Networks

دوره 3  شماره 

صفحات  -

تاریخ انتشار 1990